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(54) Motion picture generation from a static digital image 



(57) A computer system 1 comprises a processor 3 
connected to a memory 5. The processors receives im- 
age data, representing a displayable static photograph, 
through an input port 7, generates a set of video data 
by means of an application program 11 and outputs the 
video data to a display 9. The video data generated by 
the application program 1 1 provides a displayable video 
programme representing a displayable moving view- 



point over the static photograph. The application pro- 
gram 11 initially analysis the displayable content of the 
photograph and identifies characteristics, such as re- 
gions of interest or dominant lines, curves or edges, and 
depending on the characteristics, identifies the photo- 
graph as belonging to a particular image class. The vid- 
eo data is generated in accordance with rules for that 
particular image class. 
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Description 

[0001 ] This invention relates to a method of displaying 
a digital image. 

[0002] Static images are increasingly being stored in 5 
digital form, for example in computer memories. This is 
partly due to the increase in use of computer scanners 
and digital camera equipment. One of the main advan- 
tages of providing an image, e.g. a photograph, in digital 
form, is that it can be edited on a computer, displayed 10 
on a computer screen, or, with suitable equipment, dis- 
played on a television screen. 

[0003] Owing to the fact that such digital images (as 
with all static images) inherently hold a viewer's atten- 
tion for a limited period of time, methods have been de- is 
vised for producing a moving image from a static image. 
Such methods are commonly referred to as 'rostrum 
camera' techniques. A conventional rostrum camera is 
a film or television camera mounted vertically on a fixed 
or adjustable column, typically used for shooting graph- 20 
ics or animation - these techniques for producing mov- 
ing images are of the type that can typically be obtained 
from such a camera. Essentially, these techniques in- 
volve different parts of the static image being displayed 
over time to provide an overall motion effect 'over' the 25 
image. For example, a perceivable panning motion from 
one part of the image to another might be employed. As 
a further example, a zooming effect might be used. The 
main purpose of using such rostrum camera techniques 
is to generate continued interest in an image by convert- 30 
ing a static image into a moving image (or rather, a se- 
ries of consecutive static images so arranged to repre- 
sent motion). Rostrum camera techniques also have ad- 
vantages in terms of displayable resolution. By moving 
over an image, detail of an image can be show that oth- 35 
erwise cannot be shown on a low resolution display 
(without zooming-in). 

[0004] Such conventional rostrum camera techniques 
are generally manual, in that they require a user to ob- 
serve the displayed image, and manually to plot the path 40 
or order of the image parts to be displayed in a moving 
sequence. This can be time consuming, and requires 
some technical knowledge of camera equipment or dig- 
ital image editing software. In order to provide a good- 
quality moving image, particularly for photographs of re- 45 
al-life scenes, some knowledge of photographic compo- 
sition is also required. This is clearly disadvantageous 
to an average person who wishes to add some degree 
of interest and variation to, say, a still photograph. 
[0005] Basic automation of rostrum camera tech- 50 
niques are provided in a few digital image editing pack- 
ages, such as 'Photo-Vista' from MGI Software Corp. 
Such a package provides a virtual moving viewpoint of 
a static digital image by moving from one side to the oth- 
er, by moving in a wave like motion, or by displaying ran- 55 
dom parts (much like a screensaver slide-show). How- 
ever, no account is taken of the image content, and no 
photographic composition (even at a basic level) is ac- 



counted for. 

[0006] In many conventional techniques, computer 
memory is taken-up with displaying uninteresting parts 
of images and of those image parts that are considered 
significant or interesting, only small sections may be 
shown with a 'cutting-off' effect. Also, in terms of digital 
examples, large amounts of time and computational re- 
sources are used. 

[0007] According to a first aspect of the present inven- 
tion, there is provided a method of displaying a digital 
image, the method comprising: acquiring a set of image 
data representative of a displayable static image; per- 
forming an analysis of the image data using a process- 
ing means to identify characteristics of the image con- 
tent; and generating, in the processing means, a set of 
video data for output to a display, the video data repre- 
senting displayable motion overthe static image and be- 
ing generated in accordance with the image content 
characteristics. 

[0008] The method provides an effective way of auto- 
matically producing a moving viewpoint over an other- 
wise static image, with the video data (representing the 
moving viewpoint) being dependant on characteristics 
of the image content. Thus, what is actually shown in 
the static image is taken into account when generating 
the moving viewpoint. This provides for a much more 
effective video sequence and allows the layman to pro- 
duce highly effective results without any knowledge of 
photographic composition. Computer memory is used 
more efficiently to display those parts of the image in 
which the person viewing is likely to be interested. 
[0009] It will be appreciated that the video data gen- 
erated can be of any digital video format, and may make 
use of some of the more structured or interactive video 
playback formats which are emerging in the field. The 
video data need not simply relate to a stream of frames 
which are played sequentially, but could also relate to 
video programmes where a viewer can browse the pro- 
gramme in a non-sequential manner. This is possible in 
a format called Synchronised Media Integration Lan- 
guage (SMIL). This format allows identification of clips 
and other structure so that the viewer can jump around 
the video. Preferably, the step of performing an analysis 
of the image data comprises determining which of a 
number of predefined image characteristics are present 
in the image, and the step of generating the video data 
comprises executing an algorithm associated with those 
characteristics identified, the algorithm defining a rule 
or rules for generating a moving viewpoint over the im- 
age for display. 

[0010] The step of performing an analysis of the im- 
age may further comprise identifying a predefined im- 
age class wherein, in that image class, sub-parts of the 
image have predefined characteristics, and establishing 
index frames based on a close-up view of each identified 
sub-part, the step of generating the video data compris- 
ing executing an algorithm for determining a display 
path from one index frame to the next. In the step of 
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generating the video data, the algorithm may further de- 
termine one or more of (a) the order of the index frames 
to be displayed, (b) the amount of time for which each 
index frame is displayed, and (c) the nature of the tran- 
sition between each index frame. s 
[0011] The step of identifying the predefined image 
class having sub-parts with predefined characteristics 
may comprise identifying regions of interest and per- 
forming a feature recognition operation. The step of per- 
forming feature recognition may identify human facial 10 
features, the step of establishing index frames thereaf- 
ter comprising forming index frames based on a close- 
up view of the facial features. Having identified human 
facial features, the step of performing feature recogni- 
tion may further comprise comparing the facial features *s 
with a database of pre-stored facial features such that 
the step of forming index frames is performed only for 
those facial features already present in the database. 
The step of generating the video data to establish a dis- 
play path can comprise determining the orientation of 20 
the facial features, and generating a display path which 
follows the general gaze direction which the facial fea- 
tures exhibit. 

[0012] As an alternative to the index-frame method 
above, the step of performing an analysis of the image 25 
may further comprise identifying a predefined image 
class wherein, in that image class, there are one or more 
dominant edges, lines or curves, the step of generating 
the video data comprising executing an algorithm for de- 
termining a display path following the one or more dom- 30 
inant edges, lines or curves. 

[0013] Further, the step of performing an analysis of 
the image data may further comprise: (a) identifying a 
predefined image class wherein, in that image class, 
there are both (i) image sub-parts having predefined 35 
characteristics, and (ii) dominant edges, lines or curves; 
and (b) establishing index frames based on a close-up 
view of each identified image sub-part in (i), the step of 
generating the video data comprising executing an al- 
gorithm for determining a display path moving between 40 
each index frame and following the dominant edges, 
lines or curves. 

[0014] In the step of generating the video data, the 
algorithm may define rules having a first level and at 
least one sub-level, the rules in the first level relating to 45 
identification of a predefined image class, and the rules 
in the at least one sub-level relating to options for gen- 
erating the moving viewpoint for the image class identi- 
fied. The method may further comprise prompting the 
user manually to select an option in a sub-level. so 
[0015] The step of generating the video data may 
comprise generating video data for a plurality of video 
sub-clips, each sub-clip representing displayable mo- 
tion over a different part of the static image, and wherein 
the method further comprises an editing step for linking 55 
the sub-clips to form a second set of video data. 
[0016] The above method finds particular application 
where the image data is representative of a displayable 



photograph. 

[0017] According to a second aspect of the present 
invention, there is provided a computer program stored 
on a computer-usable medium, the computer program 
comprising computer-readable instructions for causing 
the computer to execute the steps of: acquiring a set of 
image data representative of a displayable static image; 
performing an analysis of the image data using a 
processing means to identify characteristics of the im- 
age content; and generating, in the processing means, 
a set of video data for output to a display, the video data 
representing displayable motion over the static image 
and being generated in accordance with the image con- 
tent characteristics. 

[0018] According to a third aspect of the present in- 
vention, there is provided a computer system compris- 
ing a processing means, a data port and a video port, 
the processor being arranged to receive image data rep- 
resentative of a displayable static image from the data 
port, and wherein the processing means is further ar- 
ranged to access and to perform an analysis of the im- 
age data to identify characteristics of the image content, 
and to generate a set of video data representing display- 
able motion over the static image according to the image 
content characteristics, the processor being arranged to 
output the video data to the video port for display. 
[0019] The invention will now be described, by way of 
example, with reference to the accompanying drawings 
in which: 

Figure 1 is a block diagram representing a computer 
system for running a computer program according 
to the invention; 

Figure 2 is a block diagram representation of a set 
of rules within the computer program for generating 
video data from a static image; 

Figures 3a to 3d represent the stages of forming a 
video programme from a static photograph depict- 
ing a group of people; 

Figure 4 represents a photograph showing a land- 
scape scene; and 

Figure 5 represents a photograph showing a land- 
scape scene with a subject being in the foreground. 

[0020] A first embodiment of the invention is shown in 
Figure 1 . Referring to Figure 1 , a computer system 1 
comprises a processor 3 connected to a memory 5. The 
processor3 receives data via an input port 7 and outputs 
data to a display 9, such as a monitor. In this case, the 
input port 7 receives image data from an image source, 
for example a scanner or a digital camera. The image 
data received represents a static image, here a photo- 
graph, and is stored in the memory 5. Also stored in the 
memory 5 is an application program 11 for generating a 
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set of "video data" from the image data. The video data 
generated by the application program 11 provides a dis- 
playable "video programme" representing a displayable 
moving viewpoint over the static image. This video pro- 
gramme can thereafter be viewed on the display 9. It will 
be appreciated that such a video programme will actu- 
ally be composed of a plurality of separate static images 
so sequenced to give the motion effect. For ease of ex- 
planation the terms video data and video programme 
will be referred to. 

[0021] Once the acquired image data is stored in the 
memory 5, the application program 11 is launched, and 
the location of the file holding the image data specified 
to the program. The program 11. operating under the 
control of the processor 3, then performs an analysis of 
the image data such that particular characteristics of the 
image content (i.e. that content which can be viewed in 
two dimensions on a display) are identified. The appli- 
cation then proceeds to generate the abovementioned 
video data based on the identified image content char- 
acteristics. To facilitate this, the application program 11 
includes an algorithm comprising a set of rules for de- 
termining how the video programme should display a 
moving viewpoint based on the different characteristics 
identified. 

[0022] In the present embodiment, the analysis stage 
of the application program 11 initially identifies which 
one of a plurality of predefined image classes the image 
data stored actually represents. Given that the intended 
image data relates to photographs, three image classes 
are provided in the program 1 1 , namely (a) a face/group 
traversal class, (b) a landscape traversal class, and (c) 
a combined face/landscape traversal class. 
[0023] In (a), i.e. the face/group traversal class, the 
analysis stage of the application program 1 1 uses stand- 
ard image analysis techniques to identify so-called 're- 
gions of interest'. Such techniques effectively involve 
processing the image data to decompose and find 'in- 
teresting 1 points within the image. As an example, such 
a technique might include identifying and segmenting 
regions having substantially uniform texture and colour. 
A further known method is to perform face detection 
whereby human facial features are identified. This latter 
method is used in the application program 11 to identify 
facial features within the image data belonging to one 
or more persons captured in a photograph. Assuming 
one or more faces are identified, the program 1 1 estab- 
lishes 'index frames' of those faces based on a zoomed- 
in view of the face. 

[0024] in the next phase of the application program 
11 , i.e. the video data generation stage, a display path 
is established, which effectively represents the moving 
viewpoint over or between the one or more index frames 
that the video data will produce when displayed. As ex- 
plained briefly above, the application program 11 com- 
prises an algorithm, or set of rules, many of which are 
based on photographic composition techniques for pro- 
ducing appropriate and interesting effects according to 



the image content. Thus, for example, where one face 
is identified in an image, the video data may represent 
an overview shot, showing the entire image, followed by 
azooming-in effect to the face. Where one or more faces 
5 are identified, the overview may be followed by a pan- 
ning motion between each index frame (hence group 
traversal class), if necessary, incorporating zooming-in 
and zooming-out effects to take account of some faces 
being closer than others. Of course, various other rules 
10 may be applied to obtain suitable motion effects. These 
will be discussed in more detail below. 
[0025] In (b), i.e. the landscape traversal class, the 
analysis stage of the application program 11 identifies 
dominant edges, curves or lines in an image. For exam- 
's pie, in a landscape scene having hills and mountains in 
the distance, the program 11 identifies the dominant 
edges, curves or lines (i.e. as being significantly more 
dominant than any particular region of interest, if any 
are present) and so knows to operate in this class in the 
20 video data generation stage. In this stage, the video data 
generated will follow the general path of the dominant 
edges, curves or lines in the image (hence landscape 
traversal class). As with the group traversal class, other 
rules may be applied to provide more sophisticated mo- 
25 tion effects. 

[0026] In (c), i.e. the combined face/landscape traver- 
sal class, the analysis stage identifies image data rep- 
resenting an image having both (i) one or more regions 
of interest and (ii) a dominant edge, line or curve. In ef- 
30 feet, it combines the analysis stages of (a) and (b) 
above. In this case, the video data generation stage will 
plot a path for showing the identified regions of interest 
whilst also following the general path of the edge, line 
or curve. Th is is a slightly more complex task, but all the 
35 same, provides for interesting moving image effects. 
[0027] Figure 2 shows a block diagram representing 
the general operation of the application program 1 1 . As 
described above, in the initial analysis stage, the pro- 
gram 11 establishes into which of the three classes the 
40 image data falls. Having performed this, the video gen- 
eration stage will proceed to operate according to the 
predefined algorithm, or set of mles belonging to the 
class identified (and represented as blocks 13, 15 and 
1 7). Sub-rules are provided which are used to determine 
45 how the video data will be generated in that class. In 
Figure 2, sub-rules are shown and described with re- 
spect to the group traversal class only, although it will 
be appreciated that suitable sub-rules may be formulat- 
ed forthe landscape traversal and combined face/land- 
50 scape transversal classes. 

[0028] An example image is shown in Figure 3a, the 
image representing a photograph 27 of a group of peo- 
ple, in the analysis stage of the application program 11 , 
regions of interest are identified in the image firstly by 
55 identifying areas having consistent colour and texture, 
and then by identifying the facial features. The program 
1 1 then makes the decision that this image must belong 
to the group traversal class since it contains a number 
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of distinct regions of interest (although it should be noted 
that even a single face or other region of interest would 
fall into this class also). Having decided on the appro- 
priate image class, the program 11 proceeds to gener- 
ate a video programme based on the sub-rules shown 
in Figure 2. 

[0029] Referring back to Figure 2, the first sub-rule 19 
in the group traversal class relates to index frame se- 
lection. This selection is used to derive a set of still 
frames, or index frames. Having identified that there are 
a number of faces in the analysis stage, the obvious 
choice is to use a close-up version of each face as the 
index frames. This is illustrated in Figure 3b, where in- 
dex frames are represented as 27a - 27d. However, it 
should be appreciated that the choice of index frames 
may encompass more sophisticated selection rules 
based on photographic composition techniques. These 
would comprise extra sub-levels (not shown) depending 
from the index frame selection rule in Figure 2. Such 
selection techniques are described in the applicant's co- 
pending international Patent Application No. 
GB01/05683 entitled "Automated Cropping of Electronic 
Images", filed on 20 December 2001 , the contents of 
which are incorporated herein by reference. Image con- 
tent features that are relevant to composition include 
face identification (used in this example), person iden- 
tification (i.e. comparing an identified face with a data- 
base of pre-stored faces and using only recognised fac- 
es as index frames), gaze direction (faces appearing to 
look at each other providing interesting subjects), avoid- 
ing flat, boring and strongly contrasting areas, avoiding 
light areas adjacent the edges of the image etc. 
[0030] The second sub-rule 21 relates to the order in 
which the index frames will be shown. In the example 
shown, the rule 21 simply stipulates that a panning effect 
will take place from leftto right, passing over each index 
frame in turn. This is shown in Figures 3c and 3d. Since 
the face on the extreme right hand side of the image is 
nearer, and so bigger, a zooming effect is used. 
[0031] Further examples include the order following 
other rules of composition, for example, starting with an 
overview shot encompassing several regions of inter- 
est, and then showing detailed views of each region of 
interest. The overview shot sets the location and gives 
atmospheric effect. Another approach might withhold 
the location information from the user or audience to 
give some dramatic effect, the individual index frames 
being shown first before revealing the overview shot, i. 
e. by zooming-out. The ordering of index frames could 
follow the geometry of the scene. If the scene had shown 
a group of people in rows, e.g. a school photograph, the 
order could start from the top left of the scene, move 
rightwards, then down, and then leftwards to end at the 
bottom left person. Detecting the gaze of people (men- 
tioned above) in the scene can be used to plot a suitable 
path over the image. Some degree of suspense can be 
generated since the viewer will wonder what the person 
is looking at. The part of the image the people are actu- 



ally looking at can be kept out of view until the end. 
[0032] The main effects which generate reaction from 
a viewer, and so are taken into account in this sub-rule, 
include panning for indicating spatial relationships be- 

5 tween two or more index frames (cutting effects do pro- 
vide the same sense of continuity). Tilting upwards cre- 
ates feelings of rising interest and emotion, expectancy, 
hope or anticipation. Tilting downwards is allied to low- 
ering of interest and emotion, disappointment, sadness 

10 or critical inspection. Zooming-in or zooming-out indi- 
cates the viewer's behaviour towards the subject. 
Rhythmic or jerky zooms should be avoided, so any 
zooming effect should be used with care. 
[0033] The third sub-rule 23 determines the time pe- 

*5 riod for which each index frame 27a - 27d is displayed. 
As the video programme moves between the index 
frames 27a - 27d, the camera will linger at each index 
frame priorto moving to the next, in this case rightwards, 
index frame. The longer the time period, the more time 

20 the viewer is able to look at the detail in the index frame 
27a - 27d. Ultimately, the time spent viewing each index 
frame 27a - 27d will depend on the number of index 
frames and the total time allotted to the video, say 20 
seconds. For example, each index frame 27a - 27d 

25 could be viewed static for approximately 2 seconds be- 
fore the viewpoint moves to the next index frame. Of 
course, it is possible to use repeated views of the index 
frames 27a - 27d. 

[0034] The fourth sub-rule 25 determines the nature 

30 of the movement between the index frames 27a - 27d. 
This movement can vary according to the route (deter- 
mined by the order) and the rate of movement (to some 
extent, determined in the previous stage). In Figure 3c, 
the motion between the index frames 27a - 27d is 

35 formed by straight lines between the centre of each in- 
dex frame. In Figure 3d, a smoother, less jerky path is 
formed. The path could follow some structure in the im- 
age. Another approach is to use a single line as a path 
through the people, with the size of the frame sufficient 

40 to capture all of the people of interest whilst panning the 
scene. In this sense, displayable resolution is compro- 
mised to some extent for smooth camera motion. A sim- 
ple way of performing this would be to fit a line through 
a series of points (known as 'least squares fitting'). So, 

45 if a group of people were shown, a point could be made 
on each person (say, their left eye). Next, a frame size 
is used which is sufficient to include each of the people 
(plus some margin space) when the camera viewpoint 
is at the point on the line closest to the fitted line. Com- 

50 plexity occurs when there are multiple rows of people of 
different sizes. One might try to fit several lines. If the 
variation in size is too great, then the frame size will be- 
come too large and the effect of seeing detail will be lost. 
[0035] Other arrangements for panning between peo- 

55 pie or faces can be arranged. For example, if the appli- 
cation program 11 recognises people around an object, 
such as a table (e.g. with people along both longitudinal 
sides of the table, and at the opposite end to the view- 
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point, then fitting horizontal lines to the image will result 
in a video programme that zig-zags across the table. 
However, by fitting three sides of a trapezium to points 
on the people, the camera viewpoint will move around 
the table, producing a much more desirable result. 
[0036] Generally speaking, the speed of transition be- 
tween index frames may depend on the amount of in- 
terest in the regions between index frames. Panning 
slowly across uninteresting areas can prove annoying 
to the viewer. The panning should be smooth, with er- 
ratic or hesitant panning avoided. A fast pan moving rap- 
idly from one index frame to the next causes the inter- 
mediate region to appear blurred. As the viewer's atten- 
tion is moved rapidly to the next index frame, it gives it 
(the next frame) a transitory importance. 
[0037] A further example image is shown in Figure 4, 
the image representing a photograph 29 of a mountain 
range. In the analysis stage of the application program 
1 1 , no identifiable regions of interest are identified in the 
image, however, a dominant line is detected in the form 
of the mountain extremes contrasting with the back- 
ground. The program 11 then makes the decision that 
this image must belong to the landscape traversal class 
since it contains no distinct regions of interest but there 
is a dominant line. Having decided on the appropriate 
image class, the program 1 1 proceeds to generate a vid- 
eo programme based on sub-rules (not shown in Figure 
2). At the most basic level, the video programme shows 
a panning motion following the dominant edge (i.e. a zig- 
zag motion). Sub-rules may determine the direction and 
speed of motion. 

[0038] A third example image is shown in Figure 5, 
the image representing a photograph 31 of a mountain 
range with a person positioned in the foreground. In the 
analysis stage of the application program 11, one iden- 
tifiable region of interest is identified in the image, name- 
ly the face. However, a dominant line is also detected in 
the form of the mountain extremes contrasting with the 
background (as in the previous example). The program 
11 makes the decision that this image belongs to the 
combined face/landscape traversal class. Having decid- 
ed on the appropriate image class, the program 11 pro- 
ceeds to generate a video programme based on suitable 
sub-rules (not shown in Figure 2). Again, the video pro- 
gramme may show a panning motion following the dom- 
inant edge which zooms-out to shown the index frame 
of the face. Alternatively, the reverse motion can be ap- 
plied. 

[0039] In the above embodiment, the application pro- 
gram 11 produces a video programme automatically 
from the image data. However, in second and third em- 
bodiments, some interaction is provided in the applica- 
tion program 11. This interaction allows a user to ana- 
lyse and assist the specification of the moving viewpoint. 
[0040] In the second embodiment (first interactive ex- 
ample), the application program 11 is used (in its auto- 
matic mode) to produce an initial video programme. This 
video programme is then used for analysis purposes 



and subsequent interaction. For example, index frames 
can be added or removed. The order of traversal of the 
index frames can be changed. The zoom around the in- 
dex frames can also be altered. The application program 

5 11 provides an interface whereby these interactive op- 
tions can be made. For example, a button can be used 
to signal that the viewable frame should zoom-in or 
zoom-out on a particular position. The rate of movement 
can also be altered. A button could also be used to 

io speed up or to slow down the movement between index 
frames. 

[0041] In the third embodiment (second interactive 
example), having decided on the image class in the 
analysis stage, instead of proceeding to generate the 

*5 video data, the user is prompted to make certain deci- 
sions with respect to the sub-rules shown in Figure 2. 
For example, the user may indicate a point of interest in 
the image, the system thereafter using segmentation to 
deduce the region of interest and the required frame. 

20 The automated part of the application program 11 can 
then calculate a suitable path for a viewpoint over the 
image using the or each region of interest. The order 
can also be specified. As a specific example, in the 
group traversal class 13, the user may be prompted to 

25 select or deselect some of the identified index frames 
27a - 27d, to specify the order of the index frames, to 
vary the time for.which each index frame is specified, or 
to specify the nature of transition between each index 
frame. Although this may reduce the composition effects 

30 pre-specified in the program 11 , a user is able to make 
some choices to suit individual requirements. 
[0042] In a fourth embodiment, a number of separate 
video programmes are generated within a single image 
for subsequent editing to form an overall video pro- 

35 gramme. For example, a moving viewpoint could be au- 
tomatically generated for two or more different parts of 
the image. In the editing stage, the transition between 
the two programmes can be specified manually. 
[0043] In a fifth embodiment, the application program 

40 11 }s specifically arranged to analyse image data relat- 
ing to photographs, and uses motion (in this example, 
panning) to overcome some aspect ratio mismatches 
between the photograph and the visual display. This is 
useful since photographs tend to have different aspect 

45 ratios to conventional television screens. This means 
that if the whole photograph is displayed, there will be 
unused areas of the screen. This is a particular problem 
for portrait mode photographs. In this embodiment, the 
image data is analysed to detect the salient parts of the 

50 image. These are then used to form an overview frame. 
This overview frame forms the first and last frame of the 
video sequence. The application program 11 generates 
a video sequence which is a panning effect about the 
image in such a way that it ends with the salient image. 

55 The panning action ensures that the whole image can 
be viewed. 

[0044] in ail the above embodiments, use can be 
made of known visual techniques, such as fades, cuts, 
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and dissolves, to join separate video clips together. 
[0045] It should be appreciated that there can be pro- 
vided several implementation strategies for the compu- 
ter system 1 . The first case is where the image data is 
acquired and analysed in real-time and the video data 
generated on demand. The second case is where the 
viewpoint path, scale and frame rate information is 
stored after the analysis of the image is performed. In a 
third example, the whole of the image data can be sub- 
sampled to obtain a number of resolutions forming, ef- 
fectively, a pyramid of images, as well as storing all view- 
point information. Since sub-sampling is computational- 
ly expensive, caching them in advance is advanta- 
geous. The fourth case simply generates a complete 
video stream that can be played back sequentially. 
[0046] Such a method and system for displaying dig- 
ital images enables interesting and effective video se- 
quences to be generated from otherwise static images. 
Less computer memory is occupied with image data 
which represents those areas which are of little of no 
interest to the viewer. 



Claims 

1. A method of displaying a digital image, the method 
comprising: acquiring a set of image data repre- 
sentative of a displayable static image; using a 
processing means (3) to perform an analysis of the 
image data to identify characteristics of the image 
content; and generating, in the processing means, 
a set of video data for output to a display device (9) 
connected to the processing means, the video data 
representing displayable motion over the static im- 
age and being generated in accordance with the im- 
age content characteristics. 

2. A method according to claim 1 , wherein the step of 
performing an analysis of the image data comprises 
determining which of a number of predefined image 
characteristics are present in the image, and where- 
in the step of generating the video data comprises 
executing an algorithm associated with those char- 
acteristics identified, the algorithm defining a rule or 
rules for generating a moving viewpoint over the im- 
age for display. 

3. A method according to claim 2, wherein the step of 
performing an analysis of the image further com- 
prises identifying a predefined image class (13, 15, 
17) wherein, in that image class, sub-parts of the 
image have predefined characteristics, and estab- 
lishing index frames based on a close-up view of 
each identified sub-part, the step of generating the 
video data comprising executing an algorithm for 
determining a display path from one index frame to 
the next. 
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A method according to claim 3, wherein, in the step 
of generating the video data, the algorithm further 
determines at least one of (a) the order of the index 
frames to be displayed, (b) the amount of time for 
which each index frame is displayed, and (c)the na- 
ture of the transition between each index frame. 

A method according to claim 3 or claim 4, wherein 
the step of identifying the predefined image class 
(13, 15, 17) having sub-parts with predefined char- 
acteristics comprises identifying regions of interest 
and performing a feature recognition operation. 

A method according to claim 5, wherein the step of 
performing feature recognition identifies human fa- 
cial features, the step of establishing index frames 
thereafter comprising forming index frames based 
on a close-up view of the identified facial features. 

A method according to claim 6, wherein, having 
identified human facial features, the step of per- 
forming feature recognition further comprises com- 
paring the facial features with a database of pre- 
stored facial features such that the step of forming 
index frames is performed only for those facial fea- 
tures already present in the database. 

A method according to claim 6 or claim 7, wherein 
the step of generating the video data to establish a 
display path comprises determining the orientation 
of the facial features identified, and generating a 
display path which follows the general gaze direc- 
tion which the facial features exhibit. 

A method according to claim 2, wherein the step of 
performing an analysis of the stored image further 
comprises identifying a predefined image class (1 3, 
1 5, 17) wherein, in that image class, there is at least 
one dominant edge, line or curve, the step of gen- 
erating the video data comprising executing an al- 
gorithm for determining a display path following the 
at least one dominant edge, line or curve. 



10. A method according to claim 2, wherein the step of 
45 performing an analysis of the image further com- 
prises: (a) identifying a predefined image class (13, 
15, 17) wherein, in that image class, there are both 
(i) image sub-parts having predefined characteris- 
tics, and (ii) dominant edges, lines or curves; and 

50 (b) establishing index frames based on a close-up 
view of each identified image sub-part in (i), the step 
of generating the video data comprising executing 
an algorithm for determining a display path moving 
between each index frame and following the domi- 

55 nant edges, lines or curves. 

1 1 . A method according to any of claims 2 to 1 0, where- 
in, in the step of generating the video data, the al- 
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gorithm defines rules having a first level and at least 
one sub-level, the rules in the first level relating to 
identification of a predefined image class (13, 15, 
17) and the rules in the at least one sub-level relat- 
ing to options for generating the moving viewpoint 5 
for the image class identified, the method further 
comprising prompting the user manually to select 
an option in a sub-level. 

12. A method according to any preceding claim, where- io 
in the step of generating the video data comprises 
generating video data for a plurality of video sub- 
clips, each sub-clip representing displayable mo- 
tion over a different part of the static image, and 
wherein the method further comprises an editing is 
step for linking the sub-clips to form a second set of 
video data. 

13. A method according to any preceding claim, where- 
in the image data is representative of a displayable 20 
photograph. 

14. A method according to claim 13, wherein the gen- 
erated video data is representative of a panning mo- 
tion over the photograph, the initial and end frames 25 
representing salient parts of the image. 

15. A computer program (11 ) stored on a computer-us- 
able medium, the computer program comprising 
computer-readable instructions for causing the 30 
computer to execute the steps of the method ac- 
cording to any preceding claim. 

16. A computer system (1) comprising a processing 
means (3), a data port (7) and a video port, the 35 
processing means being arranged to receive image 
data representative of a displayable static image 
from the data port, and wherein the processing 
means is further arranged to access and to perform 

an analysis of the image data to identify character- 40 
istics of the image content and to generate a set of 
video data representing displayable motion overthe 
static image according to the image content char- 
acteristics, the processing means being arranged 
to output the video data to the video port for display. 

17. A method of displaying a digital image, the method 
comprising: inputting a set of image data represent- 
ative of a displayable static image into a processing 
means (3); using the processing means to automat- so 
ically perform an analysis of the image data thereby 

to identify characteristics of the image content; and 
generating, using the processing means, a set of 
video data for output to a display device (9) con- 
nected to the processing means, the video data rep- 55 
resenting displayable motion over the static image 
and being generated in accordance with the image 
content characteristics. 



18. A method of automatically generating a moving im- 
age from a static image, the method comprising: in- 
putting a set of image data representative of the 
static image into a processing means (3); using the 
processing means to automatically determine 
which of a number of predefined image character- 
istics are present in the image data; and generating, 
using the processing means, a set of video data for 
output to a display device (9) connected to the 
processing means, the video data representing dis- 
playable motion over the static image and being 
generated by means of the processing means exe- 
cuting an algorithm respectively associated with the 
or each image characteristic identified, the respec- 
tive algorithm defining a rule or rules for generating 
a moving viewpoint over the image. 

19. A method according to claim 18, wherein in the step 
of automatically determining which of a number of 
predefined image characteristics are present in the 
image data, the processing means (3) identifies at 
least one predefined image class (13, 15, 17), the 
or each image class being defined by image sub- 
parts having predefined characteristics, index 
frames thereafter being identified by the processing 
means, each index frame being a scaled view of 
each identified sub-part, the step of generating the 
video data comprising executing a respective algo- 
rithm for determining a display path from one index 
frame to the next. 
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